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ABSTRACT 

Cellular messenger RNAs (mRNAs) are associated 
to proteins in the form of ribonucleoprotein par- 
ticles. The double-stranded RNA-binding (DRB) 
proteins play important roles in mRNA synthe- 
sis, modification, activity and decay. Staufen is a 
DRB protein involved in the localized translation 
of specific mRNAs during Drosophila early 
development. The human Staufenl (hStaul) forms 
RNA granules that contain translation regula- 
tion proteins as well as cytoskeleton and motor 
proteins to allow the movement of the granule on 
microtubules, but the mechanisms of hStau1-RNA 
recognition are still unclear. Here we used a com- 
bination of affinity chromatography, RNAse-protec- 
tion, deep-sequencing and bioinformatic analyses 
to identify mRNAs differentially associated to 
hStaul or a mutant protein unable to bind RNA 
and, in this way, defined a collection of mRNAs spe- 
cifically associated to wt hStaul. A common 
sequence signature consisting of two opposite- 
polarity Alu motifs was present in the hStaul - 
associated mRNAs and was shown to be sufficient 
for binding to hStaul and hStaul -dependent stimu- 
lation of protein expression. Our results unravel how 
hStaul identifies a wide spectrum of cellular target 
mRNAs to control their localization, expression and 
fate. 

INTRODUCTION 

All along their life cycle, from initial synthesis in the 
nucleus to final degradation in the cytoplasm, messenger 
RNAs (mRNAs) are associated to many proteins to form 
ribonucleoprotein particles [for reviews see (1—3)]. Many 
of these proteins belong to the hnRNP and SR families 



and participate in the regulation of essentially all steps of 
the mRNA biological activity. In addition, other class of 
RNA-binding proteins, which contain a conserved 
double-stranded RNA (dsRNA)-binding domain 
(DRBD), also play important roles in mRNA synthesis, 
activity and decay (4). This class includes nuclear proteins 
like NFAR and RHA involved in transcription and the 
ADARs responsible for RNA editing, as well as cytoplas- 
mic proteins like PKR, PCT and Dicer, which are 
involved in the regulation of gene expression and stability. 
For most members of this family, the interaction with 
dsRNA is not sequence-specific as documented by the 
atomic structure of dsRNA bound to various RBDs 
(5,6). The DRB protein Staufen was initially identified in 
Drosophila melanogaster as a maternal factor essential for 
the proper localization of bicoid and oskar mRNAs during 
the formation of the anteroposterior axis (7,8). The cor- 
responding mammalian homologues Staufenl and 
Staufen2 participate in various aspects of the mRNA life 
cycle. Thus, human Staufenl (hStaul) (9,10) has a role in 
processes such as mRNA transport and localized transla- 
tion in polarized cells (11-13), stimulation of translation 
of specific mRNAs (14) and mRNA decay through a 
mechanism called Staufen-mediated decay (SMD), in 
which specific mRNAs are targeted to degradation by a 
protein complex including hStaul (15,16). In addition, 
hStaul has a role in infectious diseases, as it is important 
for the replication of RNA viruses such as HIV and influ- 
enza virus (17-20). The participation of hStaul in the 
processes indicated above takes place in the context of 
large ribonucleoprotein complexes or RNA granules that 
are functionally diverse and show a dynamic protein and 
RNA composition [for reviews see (21-23)]. These include 
(i) RNA transport granules, structures that contain cyto- 
skeleton and motor proteins as well as regulators of trans- 
lation such as PABP and FMRP, suggesting their role in 
RNA transport and localized translation (11-13,24), (ii) 
stress granules (25,26) and (iii) hStaul -dependent RNA 
decay complexes (15,16). 
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The observation that dmStaufen DRBD3 binds dsRNA 
without any sequence specificity (5), and yet specific fly 
mRNAs, like bicoid, oskar and prospero, are targets of 
dmStaufen indicates that the RNA signature required 
for dmStaufen recognition is still unclear. Likewise, 
hStaul binds RNA in vitro without apparent sequence 
specificity (9), but a large number of human mRNAs 
have been found associated to hStaul intracellular 
complexes in vivo, as reported by transcriptomic analyses 
of hStaul immunopurified samples (15,27) (S. de Lucas, 
unpublished results). However, for many of these hStaul- 
associated mRNAs the question remains whether they are 
directly recognized by hStaul or rather by other RNA- 
binding protein(s) present in the purified intracellular 
complexes. In this report we have analysed by deep- 
sequencing the RNAs present in affinity-purified hStaul 
intracellular complexes and compared with those 
associated to analogous complexes containing a mutant 
hStaul protein defective in RNA binding. Bioinformatic 
analyses of the sequence data identified mRNAs preferen- 
tially associated to wild-type hStaul complexes. On the 
other hand, deep-sequencing of the RNAs associated to 
RNAse-treated purified hStaul complexes indicated that 
most of the protected sequences mapped to the 3'-untrans- 
lated regions (UTRs) of the associated mRNAs and 
identified a sequence signature containing at least two 
opposite-polarity motifs analogous to human Alu se- 
quences. Functional analyses verified that this signature 
is important for mRNA association to hStaul complexes 
and plays a role in the efficient expression of the mRNAs. 

MATERIALS AND METHODS 

Biological materials 

The HEK293T (28) cell line was cultured as previously 
described (13) and transfected with DNA plasmids using 
the calcium phosphate method (29). Plasmids 
pChStaufen-TAP, pChMut-TAP and pC-TAP as well as 
rabbit and rat antibodies specific for hStaul have been 
previously described (13,18). 

Protein analyses 

The tandem affinity purification (TAP) of recombinant 
hStaufenl-TAP an Mut-TAP proteins has been previously 
described (13,18). Briefly, cell extracts were prepared in 
50 mM Tris-HCl pH 7.5, lOOmM NaCl, 5mM EDTA 
(TNE) buffer containing 0.5% NP-40, 1 mM dithiothreitol 
(DTT) in the presence of protease and RNAse inhibitors 
for 30min at 4°C. The lysates obtained were incubated 
with IgG-Sepharose (GE Healthcare) for 12 h at 4°C, 
washed in IPP-150 (150mM NaCl, lOmM TrisiHCl, 
0.1% Igepal, pH 8.0) buffer and digested with 1U of 
Tobacco Etch Virus (TEV) protease per 10 7 cells for 3h 
at room temperature. Supernatants were incubated with 
Calmodulin-agarose (Stratagene) resin for 12 h at 4°C. In 
the corresponding samples, RNAse Tl digestion was 
carried out for 1 h at room temperature before 
Calmodulin resin elution. After extensive washings, 
protein complexes were eluted in a buffer containing 
3mM EGTA (13). The purified proteins were analysed 



by electrophoresis in 10% polyacrylamide gel and either 
western blot or silver staining. 

RNA purification, cDNA synthesis and deep sequencing 

The RNAs associated to purified complexes were obtained 
by treatment of Calmodulin-agarose resin with 0.2mg/ml 
Proteinase K and 0.5% sodium dodecyl sulphate in TNE 
for 1 h at 37°C. After phenol extraction, RNAs were 
precipitated with 2 ethanol volumes and 20 ug of 
glycogen (Roche) and resuspended in diethyl 
pyrocarbonate (DEPC)-treated H 2 0. Before cDNA syn- 
thesis, each RNA preparation was monitored using the 
Agilent 2100 Bioanalyzer (Agilent technologies). RNA 
samples of hStaul-TAP (treated or not with RNAse Tl) 
and the untreated hStaul -Mut (lOOng each) were used to 
generate cDNA libraries according to Illumina mRNA- 
seq protocol at Fastens SA Company (Plan-Les-Ouates, 
Switzerland), but the poly (A+) RNA purification step 
was skipped. Furthermore, mRNA fragmentation was 
omitted in the Tl-RNAse-treated hStaul -associated 
RNA, as it already contained RNAs ~100nt in size. 
cDNA libraries were subjected to ends repair reaction, 
Illumina adapter ligation and further purified from poly- 
acrylamide gels. This purification step excluded the iden- 
tification of putative small RNAs present in the samples. 
cDNA libraries were sequenced at Fastens SA in an 
Illumina HiSeq2000 sequencer. Reads length were 100 bp 
in a single-read run cycle. 

Quantitative reverse transcriptase-polymerase chain 
reaction 

Quantitative reverse transcriptase-polymerase chain reac- 
tions (qRT-qPCRs) were used to determine mRNA and 
28s ribosomal RNA (rRNA) levels in both total cell 
RNAs and RNAs associated to TAP-purified hStaul 
complexes. First, cDNA was generated using the High- 
Capacity cDNA Reverse Transcription kit from Applied 
Biosystems. cDNA was then diluted to perform qPCR 
with SYBRGreen PCR Master Mix reagents (Applied 
Biosystems) and specific primers to detect mRNAs and 
28s rRNA (Supplementary Table S3). 

Plasmid construction 

The 3'-UTRs of MMACHC, RAB2B and FCF1 mRNAs 
were amplified using Titan One RT-PCR System (Roche 
Applied Biosystems) following the manufacturer instruc- 
tions. The specific primers used (Supplementary Table S3) 
contained an Xho I site (Forward primer) or a Kpn I site 
(Reverse primer) to facilitate the posterior cloning of RT- 
PCR products into the Xho I/Kpn I sites of pEGFP-Cl 
plasmid (Clontech Laboratories). This procedure 
generated the corresponding plasmids pEGFP-C- 
MMACHC, pEGFP-C-RAB2B and pEGFP-C-FCFl. 
To remove or invert the protected Alu repeats in the 
3'-UTR of GFP-MMACHC or GFP-FCF1 constructs, 
specific restriction enzyme sites were inserted flanking 
each of the Alu sequences using the Quick Change 
Mutagenesis Kit from Stratagene. Plasmids were then 
digested and religation performed with T4 polynucleotide 
ligase to obtain deletion mutants or clones with randomly 
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inserted Alu repeats that were then sequenced to deter- 
mine the Alu polarity. 

Bioinformatic analyses 

Deep sequencing data analysis. The output of the Illumina 
sequencer was composed of reads up to 100 nt long, some 
of which contained fragments of Illumina adapters' se- 
quences. These adapter sequences were removed and 
reads of 50-100 nt long were merged in a single fastq file 
per sample. Raw reads were aligned to representative 
human rRNA sequences (28S, 18S, 5.8S and 5S) with 
BWA version 0.5.9rcl (30) with default parameters. 
Those aligned with ribosomal sequences were not con- 
sidered for ulterior analysis steps. Filtered reads were 
aligned to human genome (hgl9) with BWA version 
0.5.9rcl with default parameters. After alignment, read 
counts that matched gene regions were normalized by 
the trimmed mean of M values method of edgeR 
package (31). Gene annotations and coordinates were 
obtained from UCSC database using the 'RefGene' 
table. Similarly, lincRNAs were identified using the 
UCSC lincRNA Transcripts database. Normalized 
counts per million of hStaul-wt and hStaul-mutl 
samples were compared and Fold Changes calculated, 
considering the hStaul-wt sample as control. Common 
dispersion and exact test for a negative binomial distribu- 
tion were used to determine the statistical significance of 
the results as described in edgeR. Finally, P-values were 
adjusted by FDR (32). Values of FDR < 0.05 and Fold 
Change < —2 were used as numerical thresholds for 
defining Stau (wt)-specific mRNAs. FIESTA viewer 
(http://bioinfogp.cnb.csic.es/tools/FIESTA) was used to 
visualize scatter plots, integrate all numerical results 
(counts per million, Fold Changes and FDR) and to 
evaluate the effect of applying different numerical thresh- 
olds to the RNAseq results. Integrative Genomics Viewer 
(IGV) 2.1 (33) was used to visualize reads aligned to 
human gene structures. The read_distribution program 
from RSeQC package (34) was used to classify and 
count aligned reads into specific genomic regions: coding 
exons, 5'-UTR, 3'-UTR and introns. 

Gene ontology classification 

The differentially represented mRNAs were analysed 
by DAVID Functional Annotation Bioinformatic Micro- 
array Analysis tool from the National Institute of Allergy 
and Infectious Diseases (NIAID-NIH) (35). 

Sequence alignment 

To determine the presence of common sequences within 
the identified protected regions, the reverse complement of 
each protected sequence was obtained. Both protected se- 
quences and their reverse complement sequences were 
aligned with MUSCLE (36). A preliminary short motif 
(M30nt) was extracted from the alignment. The 
presence of such motif in the protected sequences and cor- 
responding 3'-UTR was further assessed with Infernal 
(37). Visual inspection of the preliminary motif confirmed 
the presence of the characteristic poly-A segment joining 
FLAM and FRAM monomers in the Alu primary 
structure. To confirm the motif as a segment of a 



generalized Alu, protected sequences were analysed with 
RepeatMasker (http://www.repeatmasker.org). Alu repeat 
subfamilies were then identified in both protected se- 
quences and corresponding 3'-UTR sequences with 
RepeatMasker. A diagram showing the localization of 
Alu repeats and protected sequences in 3'-UTR was 
created with DOG 1.0 (38). A final full-length alignment 
of representative human Alu sequences (from 
RepeatMasker) and protected sequences in standard 
orientation was obtained with MUSCLE, using full-length 
Alu repeats in 3'-UTR sequences for optimal alignment. 
A Neighbourhood joining tree of protected sequences was 
obtained with Jalview (39). 

RESULTS 

To identify mRNAs specifically associated to hStaul and 
decipher the hStaul -specific RNA recognition site we used 
the experimental strategy described in Figure 1, which 
involved (i) affinity purification of either wt or RNA 
binding-defective hStaul intracellular complexes, (ii) 
deep-sequencing of the associated RNAs, (hi) bioinfor- 
matic filtering for differential mRNA presence in wt- 
versus-mutant hStaul complexes and resistance to 
RNAse, (iv) search for common features in the specific 
protected RNAs and (v) experimental validation and 
testing of functional relevance. 

Identification of human mRNAs specifically associated to 
hStaul 

The intracellular complexes containing either wt hStaul or 
a RBD3 mutant protein, which binds RNA with ~ 10-fold 
less affinity (mutl) and is expressed to levels similar to wt 
protein (Figure 2A; <10% difference in four independent 
experiments) (13), were purified using the TAP tag two- 
step affinity chromatography process. As previously 
reported, a complex pattern of proteins was found 
associated to hStaul (13) (Figure 2B, hStaul), while 
only background signal was observed for a control puri- 
fication in which only the TAP tag was expressed 
(Figure 2B, Crtl). Furthermore, similar protein compos- 
ition was found in complexes containing RBD3 mutant 
hStaul (Figure 2B, hMutl). To identify the RNA 
species interacting with hStaul protein and gain insight 
on the potential RNA interaction regions, an RNAse Tl 
treatment was included before final elution of either wt or 
mutl hStaul -TAP protein, to remove RNA sequences not 
protected within the complexes. The protein complexes 
obtained after washing away the RNAse showed a 
protein pattern similar to untreated hStaul complexes, 
as revealed by silver staining (Figure 2B). Essentially no 
RNA was detected associated to the control TAP 
complexes and reduced amounts were found associated 
to mutl- as compared with wt-hStaul. The size of 
RNAs associated to hStaul complexes was in the range 
of hundreds of nt, but it was reduced to ~100nt after 
RNAse Tl treatment (Figure 2C). Deep-sequencing was 
performed by the Illumina mRNA seq protocol on the 
RNA samples associated to wt hStaul (treated or not 
with RNAse) and to mutl hStaul, but the RNA 
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Figure 1. Diagram of the experimental approach. The main objective of the work and the various steps of the experimental approach are presented. 
The corresponding items in the paper are referred to at the right. 



samples obtained from TAP-expressing controls or 
associated to mutl hStaul and treated with RNAse did 
not contain enough RNA to be processed. 

The distribution of sequences corresponding to rRNAs 
and non-rRNAs are shown in Figure 3A. Consistent with 
the presence of ribosomal subunits in hStaul complexes 
(13), rRNAs were highly represented in wt hStaul 
complexes and this was also the case in mutl -associated 
RNAs, suggesting that the mutation in hStaul -RBD3 
is not sufficient to abolish ribosome binding to 
RNA complexes. However, RNAse treatment reduced 
~ 10-fold the percentage of rRNAs sequences detected, 
indicating that the association of rRNAs to the granule 
is more susceptible to RNAse treatment than the associ- 
ation of mRNAs. As rRNAs are normally more RNAse- 
resistant than mRNAs, these results suggest that RNAse 



treatment releases ribosomes bound to the complex 
through mRNA-mediated interactions. To further 
analyse hStaul -associated RNA, the rRNA reads were 
filtered out and the remaining sequence data were 
aligned to the human genome. A total of 3534 and 3733 
mRNAs were detected in the wt and mutl hStaul - 
associated RNAs, respectively, considering 10 as a 
minimal cut-off value for the number of reads per 
mRNA. A considerable over-representation of sequences 
corresponding to 5'-UTRs, 3'-UTRs and coding exons 
was observed for the reads from wt or mutl hStaul- 
associated RNA (Figure 3B), whereas intron sequences 
were under-represented, consistent with the cytoplasmic 
localization of hStaul complexes. 

To compare the number of reads aligned to RNAs 
associated to wt or mutl hStaul we used the coverage 
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Figure 2. Protein and RNA patterns after TAP purification of hStaul complexes. Cultures of HEK293T cells were transfected with plasmids 
expressing wt-hStaul-TAP, mutl-hStaul-TAP or the TAP tag as a control. The corresponding intracellular complexes were purified by affinity 
chromatography including or not and RNAse Tl treatment. (A) Western blot analysis of hStaul-TAP and Mut-TAP expression levels in total cell 
extracts. (B) Analysis by silver staining of the purified complexes. The size of the molecular weight markers is indicated to the left. (C) Analysis of the 
RNAs associated to each complex by Bioanalyser. 



of the wt hStaul sample as reference. In this way we 
identified 136 mRNAs (Supplementary Table SI) as well 
as 58 lincRNAs (Supplementary Table S2) under-repre- 
sented in the mutl hStaul -associated RNA (Fold 
Change < -2; FDR < 0.05), suggesting that the RNA 
binding capacity of hStaul is important for their associ- 
ation to the RNA complexes (Supplementary Table SI). 



In contrast, 79 mRNAs were found enriched in the mutl 
hStaul -associated RNA (Fold Change > 2; FDR < 0.05). 
These mRNAs would be associated to the mutant hStaul 
complexes by other RNA binding proteins and were not 
analysed further (Supplementary Table SI). To ascertain 
the potential specificity of the wt hStaul-mRNA associ- 
ation, we compared their Gene Ontology classification 
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Figure 3. Gene assignment of RNAs associated to hStaul complexes. 
The deep-sequencing data corresponding to wt- or mutl-hStaul 
associated RNAs were filtered by comparison with the human 
genome. (A) Proportion of sequence reads corresponding to ribosomal 
or non-rRNAs. (B) Proportions of non-rRNAs mapping to 5'-, 
3'-UTRs, coding exons or introns. 

(40) with the general transcriptome. The results are pre- 
sented in Supplementary Figure SI and show that the 
mRNAs preferentially associated to wt hStaul are 
enriched in specific functional, molecular function or cell 
compartment categories, like protein translation, 
ribosome or ribonucleoprotein complexes. These results 
suggest that the set of hStaul -associated mRNAs is not 
a random representation of the human transcriptome but 
rather a specific group of mRNAs. To experimentally 
validate the results obtained by deep sequencing, the 
presence of selected mRNAs in purified wt hStaul 
complexes was ascertained by RT-qPCR, using mRNAs 
not detected in the hStaul complexes as controls. As 
shown in Figure 4, the FCF1, BPNT, MAGT1 and 
MMACHC mRNAs were over-represented in wt hStaul 
as compared with mutl -associated RNA samples, whereas 
the ANKRD10 mRNA used as a control did not associate 
to either complex. 

A sequence signature is recognized in hStaul-associated 
mRNAs 

Deep sequencing technology provides an extremely 
powerful tool to identify the mRNAs present in hStaul 
complexes, and also to determine the specific regions of 
the mRNA involved in RNA-protein interactions, when 
combined with RNAse protection assays. The Integrative 
Genomics Viewer (IGV 2.1) (33,41) was used to visualize 
the localization of reads in each of the 136 mRNAs spe- 
cifically associated to wt hStaul. Among these target 
mRNAs, 51 showed at least one well-populated protected 
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Figure 4. Verification of mRNA association to hStaul complexes. 
Either total cell RNA of RNAs associated to wt-, mutl -hStaul or 
TAP control purified complexes were used for RT-qPCR using 
primers specific for FCF1, BPNT1, MMACHC or MAGT as represen- 
tative of mRNAs preferentially associated to wt-hStaul complexes or 
ANKDR10 as a non-associated mRNA control. The concentration of 
each mRNA is compared with that found in total cell RNA as a ref- 
erence. The values represent averages and standard deviations of three 
experiments. 

region 70-300 nt in length when the hStaul -associated 
RNAse-treated RNA sample was analysed (see 
Supplementary Table SI and Supplementary Figure S2 
as an example), but the remaining 85 mRNAs did not 
contain sufficient protected reads to warrant a correct 
analysis and were not considered further. In contrast, 
among the 58 lincRNAs specifically associated to wt 
hStaul only three contained a protected region 
(Supplementary Table S2; see example in Supplementary 
Figure S3). Whereas the sequence reads of these 51 
mRNAs specifically associated to hStaul localized 
mainly to coding exons and 3'-UTRs, the corresponding 
reads in the RNAse-treated RNA localized mainly to the 
3'-UTRs (Figure 5A; see Supplementary Figure S2 as an 
example). All together, these results suggest that the 
RBD3-dependent interaction of these mRNAs to hStaul 
complexes takes place through RNA signatures localized 
at their 3'-UTRs. 

Alignment of the RNAse-protected sequences found in 
the 51 mRNAs specifically associated to wt hStaul 
showed the presence of a conserved sequence motif 
~130 nt in length with a high homology among the 
various protected regions (Figure 5B). Searches of the 
human genome using the conserved sequence motif 
identified Alu repetitive elements abundantly present in 
the human genome as the best hit (Supplementary 
Figure S4; see Supplementary Figure S5 for a clustering 
analysis of the protected sequences). Thirty-one of the 
hStaul -associated RNAs contained more than one of 
these sequence motifs (Supplementary Table SI; 
Figure 6A) and the most prevalent pattern is to find at 
least two Alu motifs in opposite orientations, a pattern 
which we call an Alu-signature (Figure 6B; 
Supplementary Figure S6) and this was also the case for 
one of the lincRNAs specifically associated to hStaul 
complexes. Furthermore, such an Alu-signature was sig- 
nificantly more abundant in the wt- versus mutl- 
associated mRNAs (Supplementary Table SI and 
Supplementary Figure S7). Of note, a search for Alu re- 
petitive elements within the hStaul -associated RNAs 
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revealed the presence of additional Alu sequences that 
were not RNAse-protected within the hStaul intracellular 
complexes (Figure 6B; Supplementary Figure S6). 

Functional relevance of the hStaul-specific sequence 
signature 

To test the potential relevance of the sequence signature 
identified above for the functionality of the hStaul - 
associated mRNAs, the 3'-UTRs of a selected gene set 
including FCF1, MMACHC and RAB2B were fused 
downstream of a GFP reporter driven by a strong consti- 
tutive promoter (Figure 6B) and used for transfection ex- 
periments in human HEK293T cells. The consequences of 
including these 3'-UTRs for GFP mRNA association to 
hStaul were analysed by over expression of hStaul-TAP 
and affinity purification of the intracellular complexes. A 
5- to 7-fold increase of specific GFP mRNA association to 
hStaul was observed for all 3'-UTRs tested when 
compared with a GFP control mRNA (Figure 7A). 
Under the conditions used the association of GFP 
mRNAs to hStaul was 10- to 100-fold higher than that 
found for a TAP control (Supplementary Figure S8). To 
analyse the consequences of hStaul over expression for 
the fate and expression of the various recombinant GFP 
mRNAs, the accumulation of GFP protein and mRNA 
were determined in total cell extracts. The results are pre- 
sented in Figure 7 and indicate that the amounts of GFP 
mRNAs containing any of the selected 3'-UTRs did not 
change by increasing the cellular levels of hStaul (Figure 
7B). However, the accumulation of GFP protein was 
enhanced by 2- to 3-fold only when the selected 3'-UTRs 
were included (Figure 7C, D). These results indicate that 
the 3'-UTRs of the mRNAs associated to hStaul are 



relevant for the binding to the complexes and for the 
expression of the encoded genes. 

Next, the structure-function relationship of the 
sequence signature was analysed by mutation. The pro- 
tected Alu motifs present in the 3'-UTRs of MMACHC 
and FCF1 were either deleted (Figure 8 A) or inverted 
(Figure 8C) in the GFP constructs, and hStaul -associ- 
ation experiments similar to those described in Figure 7 
were performed. In the case of GFP-MMACHC mRNA, 
deletion of either Alu motif or double deletion resulted in 
significantly reduced mRNA association to hStaul 
complexes (Figure 8B). Inversion of Alu I also reduced 
association, but inversion of Alu II motif only caused a 
small, not significant, decrease. These results could 
mean that the third Alu motif, normally not protected 
(Figure 6B and Supplementary Figure S6), might rescue 
flipping of Alu II motif. Opposite results were obtained for 
the GFP-FCF1 construct. The association to hStaul 
complexes was either unaltered or enhanced as 
compared with wt when Alu I or Alu II were deleted or 
flipped (Figure 8C, D). 



DISCUSSION 

The localization and regulated translation of mRNAs is a 
general mechanism to allow the spatio-temporal regula- 
tion of protein expression that is used in all Eukarya, 
from yeast to humans (42,43). It is important for a 
variety of processes, including embryonic development, 
cell differentiation and motility and synaptic plasticity 
(44^17). Although it was originally described for a small 
number of specific mRNAs, like ASH1 in yeast, bicoid in 
the fly, P-actin in fibroblasts or Vgl in Xenopus (8,47^19), 
it has later been recognized as a mechanism affecting a 
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Figure 7. Association to hStaul and gene expression of recombinant 
GFP-UTR mRNAs. The plasmids expressing the GFP ORF linked to 
MMACHC, FCF1 or RAB2B 3'-UTRs were cotransfected into 
HEK293T cells with either hStaul-TAP or TAP expressing plasmids. 
As control, an irrelevant UTR present in pEGFP-Cl plasmid (248 nt) 
was used. (A) The intracellular complexes were purified by affinity 
chromatography and RNA was isolated from either total cell extracts 



high proportion of cellular mRNAs (50,51). The Staufen 
protein is a key player in the localization and regulated 
translation of specific mRNAs in several organisms 
(8,13,16,49,52,53), but the sequence/structure signatures 
it uses to identify the target mRNAs are far from clear. 
Only recently the structures of the recognition signatures 
have been elucidated for D. melanogaster K10 and oskar 
mRNAs (54,55) and for a subset of human mRNAs that 
undergo SMD (16). In the present report we took an 
unbiased approach to uncover hStaul target mRNAs 
based on the use of a mutant protein essentially unable 
to bind RNA. A combination of affinity purification, 
RNAse protection and deep-sequencing (Figure 1) 
allowed the identification of a long list of mRNAs specif- 
ically associated to hStaul and the recognition of 
a minimal binding signature defined by two inverted 
Alu-like sequences located within the mRNA 3'-UTR 
(Figures 2-6). This is in contrast with the situation for 
mRNAs target of SMD, for which a single Alu-like 
sequence has been found in the 3'-UTR and the implica- 
tion of Alu-containing IncRNAs has been reported (16). 
The experimental evidence presented here indicates that 
the specific mRNAs identified are associated to hStaul 
(Figure 4) and that their 3'-UTRs are sufficient for asso- 
ciation to hStaul and for hStaul -dependent stimulation of 
protein expression in vivo (Figure 7). Furthermore, 
deletion or inversion of the Alu-like sequences protected 
in the 3'-UTR of MMACHC gene reduced hStaul asso- 
ciation (Figure 8). These results suggest that for this target 
mRNA both inverted Alu-like sequences are necessary 
and may act in cis, as their orientation is critical. The 
situation is different for the FCF1 target mRNA, as 
similar mutations did not affect hStaul association 
(Figure 8). However, it should be mentioned that FCF1 
3'-UTR contains an additional, non-protected Alu 
sequence between those protected (Figure 6B; 



Figure 7. Continued 

or purified complexes. The accumulation of 28S rRNA and GFP 
mRNA was determined by RT-qPCR. The amount of 28S rRNA 
was used to standardize the values of GFP mRNA in each sample 
and the ratio of GFP mRNA present in the purified complex to that 
present in the original cell extract was determined. The results are pre- 
sented as relative values using those of GFP linked to a control UTR 
as reference and represent averages and standard deviations of 4-10 
experiments (A.U., arbitrary units). (B) The accumulation of 28S 
rRNA and GFP mRNA was determined by RT-qPCR in total 
extracts of cells cotransfected with either a TAP (T) or a hStaul- 
TAP expressing plasmid (S). The amounts of GFP mRNA were 
standardized by the 28S RNA content, are referred to the values of 
control samples and represent averages and standard deviations of 4-8 
experiments. (C) GFP protein accumulation was determined by western 
blot of total extracts of cells cotransfected with either a TAP (T) or an 
hStaul-TAP expressing plasmid (S). Western-blot signals were 
quantified with ImageJ. The amounts of GFP were standardized by 
the actin content of the extracts, are referred to the values of control 
samples and represent averages and standard deviations of 5-8 experi- 
ments. (D) Representative experiment showing the expression and puri- 
fication of hStaul complexes, as well as GFP expression. Endogenous 
and recombinant hStaul protein expression is shown (first panel), using 
actin as a loading marker (second panel). The purified hStaul complex 
is shown (third panel) as well as the accumulation of GFP (forth 
panel). The numbers in (A, B and C) denote the statistical significance 
as determined by the Student t-test. 
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Figure 8. Structure and phenotype of signature mutants. (A) Structure of deletion mutants of recombinant GFP-3'-UTRs (see Figure 6B). (B) The 
relative GFP mRNA binding to hStaul complexes was determined as indicated in Figure 7A and the results are presented using the values for the 
corresponding wt GFP-3'-UTR construct as a reference. The values represent averages and standard deviations of three experiments. (C) Structure of 
inversion mutants of recombinant GFP-3'-UTRs (see Figure 6B). (D) The relative GFP mRNA binding to hStaul complexes was determined as 
indicated in Figure 7A and the results are presented using the values for the corresponding wt GFP-3'-UTR construct as a reference. The values 
represent averages and standard deviations of three experiments. The numbers in (B and D) denote the statistical significance as determined by the 
Student t-test. 



Supplementary Figure S6). Such organization could 
hamper the formation in cis of the hStaul signature and 
may require the participation of trans-acting IncRNAs 
containing Alu-like sequences, as reported for hStaul -de- 
pendent decay (16). A parallel screening of the deep- 
sequencing data using the lincRNA databases indicated 
the specific association of some of these RNAs to 
hStaul complexes and a small fraction of these showed 
the presence of Alu-like sequences (Supplementary Table 
S2 and Supplementary Figure S3). Alternatively, forma- 
tion of the sequence signature in FCF1 mRNA may 
involve intermolecular signals generated by trans- 
interaction of various mRNAs, as reported for bicoid 
mRNA in the fly (56). 

The Alu sequences are members of the short 
interspersed elements (SINEs) that appeared in the 
primate ancestor some 65 Myears ago. They are typically 



~300 nt long and arose as duplication of the 7SL signal 
recognition particle RNA. The human genome contains 
~1 million Alu copies, mainly located at non-coding 
regions, which can be grouped in >200 subfamilies (57). 
They can be transcribed from an internal RNA pol III 
promoter or from nearby RNA pol II promoters and 
have important implications in genome evolution by 
gene rearrangements, mutations due to retrotransposition 
and induced recombination. In addition, important effects 
of Alu sequences have been reported in the modulation of 
gene expression, like alterations in transcription efficiency, 
alternative splicing, premature polyadenylation or mRNA 
editing (58). Here we show that a specific arrangement 
of Alu-like sequences can act as hStaul binding site 
(Figure 7). It has been shown that a signature of two 
inverted Alu sequences reduces mRNA translation (59). 
Here we further report that such a signature determines 
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hStaul -dependent enhancement in gene expression, inde- 
pendent of mRNA stability (Figure 7). Our results confirm 
and extend the recent publication by Elbarbary et al. (60) 
and suggest that recognition of the Alu signature by 
hStaul protein allows efficient mRNA nucleocytoplasmic 
transport and incorporation to the associated translation 
machinery. 

Given that Staufen is a protein that has been partly 
conserved from the fly and Xenopus to humans, it might 
be considered unexpected to find that a combination of 
Alu sequences is used by hStaul to identify target 
mRNAs, as these were only acquired recently in genome 
evolution. However, it should be noted that the specific 
mRNA binding sites of D. melanogaster bicoid and oskar 
mRNAs, bona fide targets for recognition by Staufen, 
contain stem-loop structures structurally similar to those 
expected from proximal inverted Alu sequences. In 
addition, a recent analysis of the mRNAs associated to 
Stau in D. melanogaster embryos, as detected by co- 
immunoprecipitation with recombinant or endogenous 
Stau protein, revealed structure motifs like mismatches, 
internal loops and bulges within RNA stems that could 
confer specificity to Stau binding (61). Such secondary 
structure motifs can also be predicted in the folded Alu- 
signatures described for hStaul mRNA targets (data not 
shown). On the other hand, inspection of the 3'-UTRs of 
the mouse FCF1, MMACHC and RAB2B mRNAs indi- 
cates that the latter two genes also contain mouse SINE 
elements. Hence, our results suggest that mammalian 
Staul proteins have taken advantage of the presence of 
abundant SINE elements to secure recognition of specific 
mRNA targets. 
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